An old greek handwritten OCR system based on an efficient segmentation-free approach
Identifieur interne : 000F43 ( Main/Exploration ); précédent : 000F42; suivant : 000F44An old greek handwritten OCR system based on an efficient segmentation-free approach
Auteurs : K. Ntzios [Grèce] ; B. Gatos [Grèce] ; I. Pratikakis [Grèce] ; T. Konidaris [Grèce] ; S. J. Perantonis [Grèce]Source :
- International journal on document analysis and recognition : (Print) [ 1433-2833 ] ; 2007.
Descripteurs français
- Pascal (Inist)
- Wicri :
- topic : Classification.
English descriptors
- KwdEn :
Abstract
Recognition of Old Greek Early Christian manuscripts is essential for efficient content exploitation of the valuable Old Greek Early Christian historical collections. In this paper, we focus on the problem of recognizing Old Greek manuscripts and propose a novel recognition technique that has been tested in a large number of important historical manuscript collections which are written in lowercase letters and originate from St. Catherine's Mount Sinai Monastery. Based on an open and closed cavity character representation, we propose a novel, segmentation-free, fast and efficient technique for the detection and recognition of characters and character ligatures. First, we detect open and closed cavities that exist in the skeletonized character body. Then, the classification of a specific character or character ligature is based on the protrusible segments that appear in the topological description of the character skeletons. Experimental results prove the efficiency of the proposed approach.
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PascalFrancis, to step Corpus: 000321
- to stream PascalFrancis, to step Curation: 000465
- to stream PascalFrancis, to step Checkpoint: 000286
- to stream Main, to step Merge: 000F57
- to stream Main, to step Curation: 000F43
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">An old greek handwritten OCR system based on an efficient segmentation-free approach</title>
<author><name sortKey="Ntzios, K" sort="Ntzios, K" uniqKey="Ntzios K" first="K." last="Ntzios">K. Ntzios</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens</s1>
<s2>Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Gatos, B" sort="Gatos, B" uniqKey="Gatos B" first="B." last="Gatos">B. Gatos</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Pratikakis, I" sort="Pratikakis, I" uniqKey="Pratikakis I" first="I." last="Pratikakis">I. Pratikakis</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Konidaris, T" sort="Konidaris, T" uniqKey="Konidaris T" first="T." last="Konidaris">T. Konidaris</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Perantonis, S J" sort="Perantonis, S J" uniqKey="Perantonis S" first="S. J." last="Perantonis">S. J. Perantonis</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">07-0469289</idno>
<date when="2007">2007</date>
<idno type="stanalyst">PASCAL 07-0469289 INIST</idno>
<idno type="RBID">Pascal:07-0469289</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000321</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000465</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000286</idno>
<idno type="wicri:doubleKey">1433-2833:2007:Ntzios K:an:old:greek</idno>
<idno type="wicri:Area/Main/Merge">000F57</idno>
<idno type="wicri:Area/Main/Curation">000F43</idno>
<idno type="wicri:Area/Main/Exploration">000F43</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">An old greek handwritten OCR system based on an efficient segmentation-free approach</title>
<author><name sortKey="Ntzios, K" sort="Ntzios, K" uniqKey="Ntzios K" first="K." last="Ntzios">K. Ntzios</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Department of Informatics and Telecommunications, National and Kapodistrian University of Athens</s1>
<s2>Athens</s2>
<s3>GRC</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Gatos, B" sort="Gatos, B" uniqKey="Gatos B" first="B." last="Gatos">B. Gatos</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Pratikakis, I" sort="Pratikakis, I" uniqKey="Pratikakis I" first="I." last="Pratikakis">I. Pratikakis</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Konidaris, T" sort="Konidaris, T" uniqKey="Konidaris T" first="T." last="Konidaris">T. Konidaris</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Perantonis, S J" sort="Perantonis, S J" uniqKey="Perantonis S" first="S. J." last="Perantonis">S. J. Perantonis</name>
<affiliation wicri:level="1"><inist:fA14 i1="02"><s1>Computational Intelligence Laboratory, Institute of Informatics and Telecommunications, National Research Center "Demokritos"</s1>
<s2>153 10 Athens</s2>
<s3>GRC</s3>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
</inist:fA14>
<country>Grèce</country>
<wicri:noRegion>153 10 Athens</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
<imprint><date when="2007">2007</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">International journal on document analysis and recognition : (Print)</title>
<title level="j" type="abbreviated">Int. j. doc. anal. recognit. : (Print)</title>
<idno type="ISSN">1433-2833</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Character recognition</term>
<term>Classification</term>
<term>Greek</term>
<term>Handwriting recognition</term>
<term>Image processing</term>
<term>Letter</term>
<term>Manuscript character</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Skeleton</term>
<term>Topology</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Caractère manuscrit</term>
<term>Reconnaissance caractère</term>
<term>Reconnaissance optique caractère</term>
<term>Reconnaissance forme</term>
<term>Traitement image</term>
<term>Classification</term>
<term>Topologie</term>
<term>Squelette</term>
<term>Grec</term>
<term>Lettre alphabet</term>
<term>Reconnaissance écriture</term>
<term>.</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr"><term>Classification</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Recognition of Old Greek Early Christian manuscripts is essential for efficient content exploitation of the valuable Old Greek Early Christian historical collections. In this paper, we focus on the problem of recognizing Old Greek manuscripts and propose a novel recognition technique that has been tested in a large number of important historical manuscript collections which are written in lowercase letters and originate from St. Catherine's Mount Sinai Monastery. Based on an open and closed cavity character representation, we propose a novel, segmentation-free, fast and efficient technique for the detection and recognition of characters and character ligatures. First, we detect open and closed cavities that exist in the skeletonized character body. Then, the classification of a specific character or character ligature is based on the protrusible segments that appear in the topological description of the character skeletons. Experimental results prove the efficiency of the proposed approach.</div>
</front>
</TEI>
<affiliations><list><country><li>Grèce</li>
</country>
</list>
<tree><country name="Grèce"><noRegion><name sortKey="Ntzios, K" sort="Ntzios, K" uniqKey="Ntzios K" first="K." last="Ntzios">K. Ntzios</name>
</noRegion>
<name sortKey="Gatos, B" sort="Gatos, B" uniqKey="Gatos B" first="B." last="Gatos">B. Gatos</name>
<name sortKey="Konidaris, T" sort="Konidaris, T" uniqKey="Konidaris T" first="T." last="Konidaris">T. Konidaris</name>
<name sortKey="Perantonis, S J" sort="Perantonis, S J" uniqKey="Perantonis S" first="S. J." last="Perantonis">S. J. Perantonis</name>
<name sortKey="Pratikakis, I" sort="Pratikakis, I" uniqKey="Pratikakis I" first="I." last="Pratikakis">I. Pratikakis</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000F43 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000F43 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= Pascal:07-0469289 |texte= An old greek handwritten OCR system based on an efficient segmentation-free approach }}
This area was generated with Dilib version V0.6.32. |